DE eng

Search in the Catalogues and Directories

Page: 1 2 3
Hits 1 – 20 of 58

1
Understanding Multimodal Procedural Knowledge by Sequencing Multimodal Instructional Manuals ...
BASE
Show details
2
OntoNotes Release 5.0
Weischedel, Ralph; Palmer, Martha; Marcus, Mitchell. - : Linguistic Data Consortium, 2013. : https://www.ldc.upenn.edu, 2013
BASE
Show details
3
OntoNotes Release 5.0 ...
Weischedel, Ralph; Palmer, Martha; Marcus, Mitchell. - : Linguistic Data Consortium, 2013
BASE
Show details
4
OntoNotes Release 4.0
Weischedel, Ralph; Palmer, Martha; Marcus, Mitchell. - : Linguistic Data Consortium, 2011. : https://www.ldc.upenn.edu, 2011
BASE
Show details
5
OntoNotes Release 4.0 ...
Weischedel, Ralph; Palmer, Martha; Marcus, Mitchell. - : Linguistic Data Consortium, 2011
BASE
Show details
6
OntoNotes Release 3.0
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2009. : https://www.ldc.upenn.edu, 2009
BASE
Show details
7
OntoNotes Release 3.0 ...
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2009
BASE
Show details
8
OntoNotes Release 2.0
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2008. : https://www.ldc.upenn.edu, 2008
BASE
Show details
9
OntoNotes Release 2.0 ...
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2008
BASE
Show details
10
OntoNotes Release 1.0
Abstract: *Introduction* Natural language applications like machine translation, question answering, and summarization currently are forced to depend on impoverished text models like bags of words or n-grams, while the decisions that they are making ought to be based on the meanings of those words in context. That lack of semantics causes problems throughout the applications. Misinterpreting the meaning of an ambiguous word results in failing to extract data, incorrect alignments for translation, and ambiguous language models. Incorrect coreference resolution results in missed information (because a connection is not made) or incorrectly conflated information (due to false connections). Some richer semantic representation is badly needed. The OntoNotes project is a collaborative effort between BBN Technologies, the University of Colorado, the University of Pennsylvania, and the University of Southern California's Information Sciences Institute to produce such a resource. It aims to annotate a large corpus comprising various genres of text (news, conversational telephone speech, weblogs, use net, broadcast, talk shows) in three languages (English, Chinese, and Arabic) with structural information (syntax and predicate argument structure) and shallow semantics (word sense linked to an ontology and coreference). OntoNotes builds on two time-tested resources, following the Penn Treebank for syntax and the Penn PropBank for predicate-argument structure. Its semantic representation will include word sense disambiguation for nouns and verbs, with each word sense connected to an ontology, and coreference. The current goals call for annotation of over a million words each of English and Chinese, and half a million words of Arabic over five years. The authors wish to make this resource available to the natural language research community so that decoders for these phenomena can be trained to generate the same structure in new documents. Lessons learned over the years have shown that the quality of annotation is crucial if it is going to be used for training machine learning algorithms. Taking this cue, we ensure that each layer of annotation in OntoNotes will have at least 90% inter- annotator agreement. Our pilot studies have shown that predicate structure, word sense, ontology linking, and coreference can all be annotated rapidly and with better than 90% consistency. *Samples* The following screen captures provide examples of the data contained in this corpus. * English tree. * English sense predicate structure. * Chinese tree and sense predicate structure. *Sponsorship* This work was suppported in part by the Defense Research Advanced Projects Agency, GALE Program Grant No. HR0011-06-C-0022. The content of this publication does not necessarily reflect the position or policy of the Government, and no official endorsement should be inferred. ; Discourse type: narrative
Keyword: computational linguistics
URL: https://catalog.ldc.upenn.edu/LDC2007T21
BASE
Hide details
11
OntoNotes Release 1.0 ...
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2007
BASE
Show details
12
BBN Pronoun Coreference and Entity Type Corpus
Weischedel, Ralph; Brunstein, Ada. - : Linguistic Data Consortium, 2005. : https://www.ldc.upenn.edu, 2005
BASE
Show details
13
BBN Pronoun Coreference and Entity Type Corpus ...
Weischedel, Ralph; Brunstein, Ada. - : Linguistic Data Consortium, 2005
BASE
Show details
14
Extracting Dynamic Evidence Networks
In: DTIC AND NTIS (2004)
BASE
Show details
15
Challenges in Information Retrieval and Language Modeling
In: R. Manmatha (2003)
BASE
Show details
16
Challenges in Information Retrieval and Language Modeling
In: Andrew McCallum (2003)
BASE
Show details
17
TREC-9 Cross-Lingual Retrieval at BBN
In: DTIC (2000)
BASE
Show details
18
Algorithms That Learn to Extract Information BBN: Description of the Sift System as Used for MUC-7
In: DTIC (1998)
BASE
Show details
19
Challenges in Natural Language Processing
In: Computational linguistics. - Cambridge, Mass. : MIT Press 21 (1995) 1, 120-123
OLC Linguistik
Show details
20
BBN: Description of the PLUM System as Used for MUC-6
In: DTIC (1995)
BASE
Show details

Page: 1 2 3

Catalogues
1
3
2
0
0
0
0
Bibliographies
10
0
0
0
0
0
0
0
3
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
40
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern